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MULTIPLE ERGODIC THEOREMS FOR ARITHMETIC SETS 


NIKOS FRANTZIKINAKIS AND BERNARD HOST 


Abstract. We establish results with an arithmetic flavor that generalize the polyno¬ 
mial multidimensional Szemeredi theorem and related multiple recurrence and conver¬ 
gence results in ergodic theory. For instance, we show that in all these statements we 
can restrict the implicit parameter n to those integers that have an even number of dis¬ 
tinct prime factors, or satisfy any other congruence condition. In order to obtain these 
refinements we study the limiting behavior of some closely related multiple ergodic 
averages with weights given by appropriately chosen multiplicative functions. These 
averages are then analysed using a recent structural result for bounded multiplicative 
functions proved by the authors. 


1. Introduction and main results 

1.1. Introduction. The multi-dimensional Szemeredi theorem of H. Furstenberg and 
Y. Katznelson im, stated in ergodic terms, asserts that if Ti,...,T£ are commuting 
measure preserving transformations acting on the same probability space (A, T, /r), then 
for every A € A with /r(A) > 0 there exists n G N such that 

^(Tfn • • • n r^“”A) > o. 

More recently, T. Tao [23| established mean convergence for some closely related multiple 
ergodic averages by showing that for Fi,... ,F£ G L °°(^) the averages 

1 ^ 

n=l 

converge in the mean as A —>■ oo. In this article we are interested in studying variants 
of such statements where the parameter n is restricted to certain subsets of the integers 
of arithmetic nature. For instance, we are interested in knowing whether the previous 
results remain true when we restrict the parameter n to those integers that have an even 
(or an odd) number of distinct prime factors. More generally, do they hold if we restrict 
n to those integers that have a mod b distinct prime factors for some a, 6 G N? 

We answer these questions affirmatively. In order to give some model results in this in¬ 
troductory section (extensions and related statements appear in Section fL^ we introduce 
some notation. For a, 6 G N we let Sa,b consist of those re G N whose number of distinct 
prime factors is congruent to a mod b. It can be shown that for every a G {0,..., 6 — 1} 
the set Sa^b has density 1/6 (see the second remark after Proposition 12.101) . 

Theorem A. Let Ti, ... ,T£ be commuting measure preserving transformations acting 
on the same probability space (A, A, ^u). Then for every A G A with /u(A) > 0 we have 

/i(rf’"A n • • • n r-'^A) > o 

for a set of re G Safi with positive lower density. 

We deduce from this ergodic statement, via the correspondence principle of H. Fursten¬ 
berg (see Section Mi) that every set of integers with positive upper density contains 
arbitrarily long arithmetic progressions with common difference taken from the set Safi', 
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similar statements also hold for the multidimensional Szemeredi theorem, polynomial 
variants of it (see Theorem [TS]), and for any shift of the sets Sa,b- 

Theorem B. Let Ti,... ,T£ be commuting measure preserving transformations acting 
on the same probability space {X, X, fi). Then for all Fi,..., Fi G the averages 

(1) 1 ^ 

neSa,bnli,N] 

converge in In fact, the limit is equal to limjv^oo 5 ^ Un=i T^Fi ■ ■ ■ T^Fg. 

In order to analyze the averages m we do not use the theory of characteristic factors; 
even for averages of the form ^ X]n=i T^Fi ■ ■ ■ T^F^ this theory is very intricate and not 
yet developed to an extent that facilitates our study. Instead, we proceed by comparing 
the averages ([T]) with the averages ^ J2n=i '^ 1^1 ''' and show that the difference 
converges to 0 in To do this, we work with some weighted multiple ergodic aver¬ 

ages with weights given by suitably chosen multiplicative functions. Then the asserted 
convergence to 0 is a consequence of the next statement. 

Theorem C. Let / G A4conv be a multiplicative function (see definition in Section 11.21) . 
If Ti,..., are commuting measure preserving transformations acting on the same prob¬ 
ability space (X, X, fi), then for all Ti,..., T) G the averages 

1 ^ 

(2) - j;/(n)-rfFi...T;F, 

n=l 

converge in Furthermore, the limit is zero if / is aperiodic (see definition in 

Section [HI) 

Let us briefly explain how we derive Theorem A and B from Theorem C. For 6 G N 
we let be a root of unity of order b and let / be the multiplicative function defined by 
f{p^) = C for all primes p and all A: G N. Note that 

(3) is.y") = TEu” </("))'. 

j=o 

It follows from Corollary 12.101 that for j = 1,... ,6 — 1 the multiplicative function is 
aperiodic. Combining this with ([3]) and Theorem C we get that the difference 
IV N 

0) jv ^ ^*'‘*'*> ■ - tv E 

n=l n=l 

converges to 0 in L?‘{p). Using this and the aforementioned convergence result of T. Tao 
we deduce Theorem B. Furthermore, since the difference dH) converges to 0 in L^(/i), 
letting Fi = ... = F(^ = where p{A) > 0, integrating over X, and using the multidi¬ 
mensional Szemeredi theorem of Furstenberg and Katznelson, we deduce Theorem A. 

The proof of Theorem C depends upon a deep structural result for multiplicative 
functions proved by the authors in m Roughly speaking, it asserts that the general 
multiplicative function that is bounded by 1 can be decomposed in two terms, one that is 
approximately periodic and another that contributes negligibly to the averages ([2]). The 
approximately periodic component vanishes if the multiplicative function is aperiodic. 
In the general case, a careful analysis of the contribution of the structured component 
allows us to conclude the proof of Theorem C. 

We note that if one is only interested in the weak convergence of the averages ([2]), an 
alternate (and arguably simpler) approach is to use a decomposition result for multiple 
correlation sequences from [9]; we discuss this approach in more detail in Section [3.51 
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1.2. Recurrence and convergence results. Our main results cover a vastly more 
general setting than the one described in the previous subsection. In order to facilitate 
exposition we introduce some definitions and notation. We start with some notions from 
number theory related to multiplicative functions. 

Definition. A function /: N —>■ C is called multiplicative if 

f{mn) = f{m)f{n) whenever (m, n) = l. 

We let 


At := {/: N —> C multiplicative such that |/(n)| < 1 for every n € N} 


and 


1 ^ 

■Mconv ■= \ f € M- lim — > f{an + b) exists for every a, 5 E N >. 

1 N^oo N ) 


n=l 


We say that / E At is aperiodic if limjv^oo Y2n=i + 6 ) = 0 for every a, 6 E N. 


If a multiplicative function takes real values, then a well known theorem of E. Wirs- 
ing |25| states that it has a mean value; furthermore it belongs to Aiconv But there exist 
complex valued multiplicative functions that do not have a mean value, for example if 
/(n) = n** for some t / 0; then ^ Yln=i /(™) ~ 0 Af->'Oo(l)- Lending terminology 

from |18| . it can be shown that / E Mconv unless /(n) “pretends” to be n**x(n) for some 
t E M and Dirichlet character y. Necessary and sufficient conditions for checking when a 
multiplicative function belongs to the set Aiconv can be found in Theorem 12.91 below. 

Next we introduce some notions from ergodic theory. 


Definition. • A bounded sequence of complex numbers {w{n)) is a good universal 
weight for polynomial multiple mean convergence if for every t", m E N, probability space 
(A, A, //), invertible commuting measure preserving transformations Ti,..., : X —>■ A, 

functions Fi, ..., Fm E and polynomials pij : Z —>■ Z, i = 1, ... f’, j = 1, ..., m, 

the averages 

N £ £ 

^^w{n) ■ ..... 

n=l 2=1 2=1 

converge in where 0^=1 denotes the composition S'! o ... o S'^. A set of integers 

S' is a set of polynomial multiple mean convergence if the sequence (l 5 (n)) is a good 
universal weight for polynomial multiple mean convergence. 

• A set of integers S is set of polynomial multiple recurrence if for every i,m E N, 
probability space (A, A,/r), invertible commuting measure preserving transformations 
Ti,..., : A —>■ A, set A E A with p,{A) > 0, and polynomials pij : Z —>■ Z, i = 1,... £, 
j = 1 ,..., m, with Pij{0) = 0, we have 

£ £ 

(5) /r((]J rfn... n (]J rf> o 

2=1 2=1 

for a set of n E S with positive lower density. 


Remark. All the statements in this article refer to sets of integers S with positive density, 
hence there is no need to normalize the relevant averages. Furthermore, although we al¬ 
ways work under the assumption that the measure preserving transformations commute, 
with some additional work our arguments extend to the case where the transformations 
generate a nilpotent group; we discuss this in more detail in Section 13.41 

Our first result generalizes Theorem C from the introduction. 
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Theorem 1.1. Let f € Mconv be a multiplicative function. Then the sequence (/(n)) is 
a good universal weight for polynomial multiple mean convergence. Furthermore, if f is 
aperiodic, then the corresponding weighted ergodic averages converge to 0 in L?‘{pL). 

Remark. Examples of periodic systems show that one does not have convergence if 
/ ^ Mconv Nevertheless, following the method of [10| it is possible to show that for every 
/ G Ad there exist t G M and a slowly varying sequence g(n) (meaning max 2 ,<„< 2,2 \r](n) — 
r]{x)\ —)• 0 as X ^ oo), where both t and r] depend only on /, such that the corresponding 
weighted ergodic averages multiplied by N~'^^e{—r]{n)) converge in the mean. 

If S is the set of square-free integers, applying Theorem 11.11 for the multiplicative 
function / := I 5 we deduce that S' is a set of polynomial multiple mean convergence^ 
Next, we give generalizations of Theorems A and B from the introduction. They are 
consequences of a result that we state next. A subset of the unit interval or the unit circle 
is called Riemann-measurable if its indicator function is a Riemann integrable function. 
It is known that this condition is equivalent to having boundary of Lebesgue measure 0. 

Theorem 1.2. Let / G Ad be a multiplicative function taking values on the unit circle. 

(i) If for some A: G N the function f takes values on k-th roots of unity and K is a 
non-empty subset of its range, then f~^{K) is a set of polynomial multiple mean 
convergence. If in addition f^ is aperiodic for j = 1,... ,k — 1, then any shift of 
the set f~^{K) is a set of polynomial multiple recurrence. 

(ii) If f^ is aperiodic for all j G N and K is a Riemann-measurable subset of the unit 
circle of positive measure, then any shift of the set f~^{K) is a set of polynomial 
multiple recurrence and polynomial multiple mean convergence. 

In fact, under the aperiodicity assumptions of part (i) or (ii) we get that the set f~^{K) 
is Gowers uniform (see definition in Section \2.^) . 

We denote by uj{n) the number of distinct prime factors of an integer n and by Q{n) 
the number of prime factors of n counted with multiplicity. We let 

Suj,A,b := {n G N: uj{n) = a mod b for some a £ A} 

and similarly we define SQ^A,b- 

Corollary 1.3. For every 6 G N and A C {0,... ,6 — 1} non-empty, any shift of the 
sets Sa;,A,b CLnd Sn^A,b is a set of polynomial multiple recurrence and polynomial multiple 
mean convergence. In fact, all these sets are Gowers uniform. 

Remark. As the polynomial 2“n^ takes values in the multiple recurence property 

([5|) for some n G Sn^a,b can be inferred from the polynomial Szemeredi theorem. This 
argument does not apply for non-trivial shifts of the sets SQ^a,b and S^^a,b- Oa the other 
hand, see Section HTUl for an alternate argument that can be used to prove “linear” multiple 
recurrence statements by finding IP^-patterns within any shift of Si,j^a,b and SQ^a,b- 

For a G M and A C [0,1/2] we let 

Suj,A,a ■= {n ||t<;(n)a|| G A}, := {« G N: ||Q(n)a|| G A}, 

where ||x|| := d(x,Z) for x G M. 

Corollary 1.4. For every irrational a and Riemann-measurable set A C [0,1/2] of 
positive measure, any shift of the sets Si^^A,a o^nd Sn^A,a is a set of polynomial multiple 
recurrence and mean convergence. In fact, all these sets are Gowers uniform. 

Remark. Similar results hold if Suj^A,a and Sn^A,a are defined using fractional parts. 
^This can also be deduced directly from Theorem 12. 21 by approximating S in density by periodic sets. 
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1.3. Combinatorial implications. We give some combinatorial implications of the 
previous multiple recurrence results. We define the upper Banach density d*[E) of a set 
E C as d*{E) := limsup|/|_^oo > where the limsup is taken over all parallelepipeds 

I <Z 1 / whose side lengths tend to infinity. We use the following modification of the 
correspondence principle of H. Furstenberg (the proof can be found in [1]): 

Furstenberg Correspondence Principle f [13| ). Let G N and E C There exist 
a probability space invertible commuting measure preserving transformations 

Ti,..., : X X, and a set A G X with /J.(A) = d*(E), such that 

£ I 

d*{{E - ni) n... n (F - n^)) > n ... n ([] 

i=l i=l 

for all m G N and fij = (nij,..., G for j = 1,..., m. 

Using this result and Corollaries 11.31 and 1 1.4 [ we immediately deduce the following: 

Theorem 1.5. Let £,m G N, Z —Z^ be polynomials with qi{0) = 0 for 

i = 1,... ,m, and let E C with d*{E) > 0. Then the set 

{n G N: d*[{E - qi{n)) D ... D (F - qm{n))) > O} 

intersects any shift of the sets 'S'n,A,6; Soj^A,a, SQ^A,a (we assume that the set A 

satisfies the assumptions of Corollaries and \1.4\ ) on a set of positive lower density. 

1.4. Pointwise convergence. Variants of the previous mean convergence results that 
deal with pointwise convergence of multiple ergodic averages are, for the most part, 
completely open. The situation is only clear for the single term ergodic averages 

1 ^ 

( 6 ) 

n=\ 

where F G L°°(/r). If / is the Mobius or the Liouville function, then it is shown in jl] 
Proposition 3.1] that these averages converge pointwise to 0. This is done by combining 
the spectral theorem with some classical quantitative bounds of H. Davenport [7] for 
averages of the form ^ /(^) note though that such bounds do not hold for 

general aperiodic multiplicative functions. 

For more general / G Mconv we can treat pointwise convergence of the averages ([6]) 
as follows: If F is orthogonal to the Kronecker factor of the system, then for every 
/ G A4 the averages ([6]) converge pointwise to 0. We can establish this by combining an 
orthogonality criterion of I. Katai |21| with a result of J. Bourgain [3]; the former implies 
that the averages ([6]) converge to zero if ^ X]n=i F{T°‘'^x)-F{T^'^x) -G 0 for every a, 6 G N 
with a ^ b and the latter confirms this property pointwise almost everywhere when F is 
orthogonal to the Kronecker factor of the system. On the other hand, suppose that F 
is an eigenfunction with eigenvalue e{a) for some a G M. If a is irrational, then using a 
result of H. Daboussi OE] we deduce that for all / G A4 the averages m converge to 0 
pointwise. If a is rational, then they converge for all / G Xiconv Furthermore, in either 
case, the averages ([6]) converge to 0 if / is aperiodic. Combining the above and using 
an approximation argument, we get that if / G hAconv^ then the averages ([6]) converge 
pointwise, and they converge to 0 if / is aperiodic. We deduce from this that all the sets 
Su},A,b, Sfi^A,b, Si^^A,a, Sfi^A,a defined in Section fL2] are good for pointwise convergence 
of single ergodic averages and under the obvious non-degeneracy assumptions for the set 
A we get that for ergodic systems the normalized averages converge to J F dp, for all 
F G L°°{p). Furthermore, an approximation argument allows to extend these results to 
all F G L^Ip). 

We record here a related open problem regarding multiple ergodic averages with arith¬ 
metic weights (perhaps the simplest of this type). 



MULTIPLE ERGODIC THEOREMS FOR ARITHMETIC SETS 


6 


Problem. Let f G Mconv be a multiplicative function. Is it true that for every measure 
preserving system {X, X, p,T), and every F,G & the averages 

1 ^ 

(7) - j;/(n)-F(r-x)-G(r2-x) 

^ n=l 

converge pointwise? Do they converge to 0 if f is aperiodic? 

When / = 1 the averages d?]) converge pointwise by a result of J. Bourgain [3]. In 
general, the problem is open even when / is the Liouville function; that is, it is not known 
whether the averages ^ Yln=i Is(^) ‘ F{T’^x) ■ G{T‘^^ x) converge pointwise when S is 
the set of integers that have an even number of prime factors counted with multiplicity. 

1.5. Notation and conventions. For reader’s convenience, we gather here some nota¬ 

tion that we use throughout the article. We denote by N the set of positive integers and 
by P the set of prime numbers. For N £ N we let Zjv := 'LjN'L and [N] N}. 

We let e{t) := With ov->-oo(l) we denote a quantity that converges to 0 when 

N ^ oo and all other implicit parameters are fixed. Given transformations Ti'. X ^ X, 
i = with ni=i Fi we denote the composition Ti o • • • o T^. We use the letter / 

to denote a multiplicative function. A Dirichlet character, denoted by y, is a completely 
multiplicative function that is periodic and satisfies x(l) = 1. 

1.6. Acknowledgement. We would like to thank M. Lemanczyk for helpful remarks. 


2. Main ingredients 


2.1. Multiple recurrence and convergence results. In order to prove our main 
results we will use some well known multiple recurrence and convergence results in ergodic 
theory. The first is the polynomial Szemeredi theorem stated in ergodic terms. 

Theorem 2.1 (Bergelson, Leibman [1]). The set of positive integers is a set of polynomial 
multiple recurrence. 

The second is a mean convergence result for multiple ergodic averages. 

Theorem 2.2 (Walsh [23]b The set of positive integers is a set of polynomial multiple 
mean convergence. 


2.2. Gowers norms and estimates. We recall the definition of the ?7^-Gowers unifor¬ 
mity norms from m- 


Definition (Gowers norms on Zv [E]). Let N £ N and a: Tjn ^ C. For s G N the 
Gowers U^{'Ej\f)-norm of a is defined inductively as follows: For every t £ hi\f 

we write at{n) := a{n + t). We let 


®llc/l(ZAr) •“ 


h E 


and for every s G N we let 

ll®ll[/"+l(Zjv) 





a ■ at 


||2^ \ 


l/2“+i 


If a: N —)■ C is an infinite sequence, then by denote the [/®(Zjv)-norm of 

the restriction of a to the interval [N], thought of as a function on Zv- 


The following uniformity estimates will be used to analyze the limiting behavior of 
multiple ergodic averages. 
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Lemma 2.3 (Uniformity estimates |12[ Lemma 3.5]). Let £,m {X,£Y,^) be a prob¬ 

ability space, Ti,... ,T£: X X be invertible commuting measure preserving transfor¬ 
mations, Fi,..., Fm € be functions bounded by 1, and pij : Z —>■ Z, z G {1,..., F\, 

j € {1,... ,m}, be polynomials. Let re: N —C be a sequence of complex numbers that 
is bounded by 1. Then there exists s G N, depending only on the maximum degree of the 
polynomials pij and the integers i and m, such that 


.. N e e 


:Pi,m{n) 


)F„ 


n=l 


i=l 


2 = 1 


< 1 


LHu) 


[Af] • M\u‘IZsn) 


Furthermore, the implicit constant and the 0]sr{l) term depend only on the integer s. 


We also need the following result which follows from Lemma A.l and A.2 in m- 


Lemma 2.4. Let s >2 be an integer and e,K > 0. Then there exist <5 > 0 and Nq G N, 
such that for all integers N,N with Nq < N < N < kN, every interval J C [A^j, and 
f: Z^ —7> C with I/I < 1, the following implication holds: 

*/ \\f\\u‘(z^)<S, then ||lj •/Ilt;.( 2 ^) < e. 

2.3. Gowers uniform sets. We introduce here the notion of a Gowers uniform subset 
of the integers that was used repeatedly in the statements of our main results. 


Definition. We say that a set of positive integers S is Gowers uniform if there exists a 
positive constant c such that 

^ lim ^|| l 5 - c ||^.(^^)=0 

for every s G N. 


Remark. If such a constant exists, then applying the defining property for s = 1 gives 
that c is the density of the set S. 


If S' is a Gowers uniform set, then applying Lemma [2. 3 1 for the weight w{n) = ls{n) — c, 
n G N, and combining the definition of Gowers uniformity with Lemma 12.41 we deduce 
that for 

2 = 1 2 = 1 


we have 


1 


\Sn[N]\ 


E 


K- 


1 

N 


N 




fv) 0. 


nGSn[Af] n=l 

Using this, the recurrence result of Theorem 12.11 and the convergence result of Theo¬ 
rem [221 we deduce the following: 


Proposition 2.5. Suppose that the set S C N zs Gowers uniform. Then any shift of S 
is a set of polynomial multiple recurrence and polynomial multiple mean convergence. 


2.4. Structure theorem for multiplicative functions and aperiodicity. Next we 
state a structural result from m that is crucial for our study. We first introduce some 
notation from m Section 3]. Given /: N —>■ C and A" G N we let 

fN ■= f ■ 1[IV] 

and whenever appropriate we consider /jv as a function in Z^r. The Fourier transform 
/at of /at is defined by 
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By a kernel on we mean a non-negative function on Z^r with average 1. For every 
prime number N and 0 > 0, in na we defined two positive integers Q = Q{9) and 
V = V{9), and for N > 2QV, a function 4)n,9 ■ “>■ C given by the formula 

(t>N,e ■= ^ (l - 




N 




where 


(8) := E : 

Then for every ^ E Zjy we have 


0 ^ 

N 


(9) 





N 

w 



}■ 


if ^ E ; 
otherwise. 


Theorem 2.6 (Structure theorem for multiplicative functions |11[ Theorem 8.1]). Let 
s E N and e > 0. Then there exist a real number 9 > 0 and Nq E N, depending on s and 
e only, such that for every prime N > Nq, every f £ A4 admits the decomposition 

f{n) = fN,st{n) + fN,nn{n), for every n E [A^j, 

where fN,st,fN,un- [A^j C are bounded by 1 and 2 respectively and satisfy: 

(i) /at,S t = fN * 4'N,9 where (f>N,9 is the kernel on Z]\f defined previously and the 
convolution product is defined in Z^; 

(ii) ||/Ar,un||(7s(gj^) < £• 

Remark. In |11| this result is stated with / multiplied by a certain cut-off. The cut-off 
is not needed for our purposes and exactly the same argument proves the current version. 


We think of fN,st and fN,un as the structured and uniform component of / respectively. 
From this point on we assume that N > 2QV. When convenient we identify Ztv 
with the set {0,..., A^ — 1} and we denote by (a, b) mod N the set that consists of those 
^ E Ztv such that ^ -|- kN E (a, b) for some k £ Z. Note that f £ EN,e if and only if there 
exists p £ Z such that £ {—V, V) mod N. Hence, 

Q-i 

^n,9 = U -V,^N + V) mod N, 

p=0 ^ ^ 

We may choose to include or omit the endpoints of each interval (if they are integers), 
since for these values the Fourier transform of 4>n,9 is 0. Hence, we can assume that 

Q-i 

(10) En,9 = [J ^N,9,p 

p=0 

where for p = 0,..., Q — 1 we have 

^N,9,p ■= { -V<j<V}. 

Note that for fixed N > 2QV and 9 > 0 the sets En, 9 ,p, p = 0, ■ ■ ■ ,Q — 1, are disjoint, 
each of cardinality 2V, hence = 2,QV. Furthermore, if A^ = 1 mod Q, then 

(11) EN, 9 ,p = {^{N-l)+j modN: - V<j<V}. 

Restricting N to a specific congruence class mod Q is needed in the proof of Lemma 13.31 
We will also use the following consequence of Theorem 12.61 it can be derived by 
combining Theorem 2.4 and Lemma A.l in m- 
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Theorem 2.7 (Aperiodic multiplicative functions |11)1. Let f & Ai be an aperiodic 
multiplicative function and for N let Ijy be a subinterval of [A^]. Then 

^li^ II1/^ • /||{;.(z^) = 0 for every s € N. 

2.5. Halasz’s theorem and consequences. To facilitate exposition, we define the 
distance between two multiplicative functions as in |18j : 

Definition. If /, <7 € A4 we let D: Ad X Ad —>■ [0, 00 ] be given by 

= X] “ “ Re(/(p) 5 (;^) 

pGP ^ 

Remark. Note that if |/| = \g\ = 1, then 0{f,gf = ^ \f{p) “ 9{p)\^- 

It can be shown (see |18) 1 that B satisfies the triangle inequality 

B(/,<7) <B(/,h)+D(/i,5). 

Also for all fi, f 2 ,9i, 92 € Ad we have (see m Lemma 3.1]) 

(12) D(/i/2,fi'i5'2) < ©(/uS-i)+1D)(/2,52)- 

We will also use that if / G Ad is such that for some c in the unit circle we have f{p) = c 
for all primes p, then B(/, n**) = 00 for every t ^ 0. In particular we have B(l, n**) = 00 
for every t ^ 0. Using this and the triangle inequality, one deduces that for / G Ad we 
have D(/, n**) < 00 for at most one value of t G M. We will use the following celebrated 
result of G. Halasz: 

Theorem 2.8 (Halasz |19|i. A multiplicative function / G Ad has mean value zero if 
and only if for every t G M we either have D(/, n**) = 00 or f{2^) = —2®^* for all A: G N. 

Remark. Since / is aperiodic if and only if for every Dirichlet character y the multi¬ 
plicative function / • x has mean value zero, this result also gives necessary and sufficient 
conditions for aperiodicity. 

Another consequence of the mean value theorem of Halasz (see for example (S] Theo¬ 
rem 6.3]) is the following result that gives easy to check necessary and sufficient conditions 
for a multiplicative function to have a mean value (not necessarily zero). 

Theorem 2.9. Let / G Ad. Then f has a mean value if and only if we either have 

(i) B(/, n®*) = 00 for every t G M, or 

(ii) Epgp^(l-/(P)) converges, or 

(hi) For some t G M we have B(/, n®*) < 00 and f{2^) = —2®^* for all /c G N. 

Remark. Since / G Mconv if and only if for every Dirichlet character x the multiplicative 
function / • x has a mean value, this result also gives necessary and sufficient conditions 
for a multiplicative function to be in Mconv 

We deduce from the previous results the following criterion that will be used in the 
proof of Theorem 11.21 and the proof of Corollary 11.31 and 11.41 

Proposition 2.10. Let / G Ad. 

(i) If for some A; G N, / takes values on the k-th roots of unity, then f G Aiconv 

(ii) //a G M is not an integer and f[p) = e(a) for all p G P, then f is aperiodic. 

Remarks. • Sharper results can be obtained using a theorem of R. Hall j2n| and the 
argument in m Corollary 2]. For instance, it can be shown that if f{p) takes values in 
a finite subset of the unit disc for all p G P, then / G Aiconv, and if in addition /(p) 7 ^ 1 
for all p G P, then / is aperiodic. 

• By taking averages in JH]) and using that Part (ii) of the previous result implies 
aperiodicity of f^ for j = 1 ,..., 6 — 1 , we deduce that d{Safi) = p 
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Proof. We prove (i). It suffices to show that for every Dirichlet character y the multi¬ 
plicative function / • x has a mean value. Note that x takes values on roots of unity of 
fixed order for all but finitely many primes (on which it is 0). Hence, it suffices to show 
that if for some m € N a multiplicative function g takes values on the m-th roots of unity 
for all but finitely many primes, then g has a mean value. So let g be such a multiplicative 
function. If 0{g, n**) = oo for every t G M, then we are done by Theorem 12.91 Suppose 
that there exists t G M such that D(g(, n**) < oo. Using that D(l, n**) = oo for every t ^ 0 
we have by m that 

mlD){g, n**) > V){g^, n®™*) = B(l, n®”®*) -|- 0(1) = oo, for every t / 0, 


where the lower bound follows from (1121) . Hence, t = 0, which implies that 

^ i (1 - Re{g{p))) < oo. 
peP ^ 


Since g takes finitely many values on the unit disc, there exists c > 0 such that for all 
p G P we either have g{p) = 1 or 1 — Ile{g{p)) > c. Hence, 


E - 

p 

peP,5(p)7^i ^ 


< oo. 


Since |1 — g{p)\ < 2 for all p G P, we deduce that 


|1 -g(p)l 


Theorem 12.91 again gives that g has a mean value, completing the proof of {i). 

We prove (m). Using the remark following Theorem 12.81 it suffices to show that for 
every t G M and Dirichlet character y we have D(/ • y, n®*) = oo. So let y be a Dirichlet 
character. Then there exists m G N such that (x(p))®” = 1 for all but a finite number of 
primes p. Then using (|12l) we get 

mB(/ • X, n®*) > B(/®®® • x”®, n®®®®‘) = B(/®", n®®®®*) + 0(1) = oo, for every t ^ 0, 


where the last distance is infinite since /®®® is constant on primes and mt 7 ^ 0. It remains 
to show that B(/ • X) 1) = 00 . Suppose that x has period d. Since x(l) = 1) have 
x(ra) = 1 whenever n = 1 mod d, and since /(p) = e{a) for all p G P, we have 

ID>(/• X, 1)^ > (1 - cos(27ra)) • ^ - = 00 , 

pePn((iZ+i) ^ 

where we used that (1 — cos(27rQ:)) 7 ^ 0 because a ^ Z and the divergence of the last 
series follows from Dirichlet’s theorem. This completes the proof of (ii). □ 


3. Proof of main results 

3.1. Proof of Theorem II.11 We start with a few elementary lemmas. 

Lemma 3.1. Let {Vn) he a bounded sequence of elements of a normed space such that 


1 ^ 

lim — Vn = V. 


n=l 


For N £ N let N > N be integers such that the limit (3 := limTV-j-oo ^ exists. Then for 
every a G M ire have 

1 / a 


-X:s(„,)r„ = c.r 




where c = ^ Jq e(ay) dy if (3 ^ Q and c = 1 if (3 = 0. 
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Proof. For n G N let 


Sn := - V)- 


k=l 


Our assumption gives that Sn/n^Oasn—)-oo. Using partial summation we get that 
the modulus of the average 


1 ^ 


n=l 


is at most 


N-l 


+Ojv^oo(l) 

n=2 ^ 


Let e > 0. Since Sn/n —> 0 as n ^ oo, we have |5n| < en for every sufficiently large n, 
and thus the last expression is bounded by 

1 |27rcKl 

(^|7rQ:| + l)e + OAr^oo(l)- 


n=2 


Since e is arbitrary, we get that 


1 


N 


lim — Ve(n^) (K - U) = 0. 
N^oo N ^ ^ ’ 

n=l 


Note also that if /3 > 0, then 


lim — 

N^oo N 


^ 1 


and the previous limit is 1 if /? = 0. Combining the above we get the asserted claim. □ 

Next we show that the discrete Fourier transform of elements of Mconv along certain 
“major arc” frequencies converges. 


Lemma 3.2. Let f € Me 

Then the averages 
(13) 


Let Q € N, p, € Z, and 


n=l 


converge. 

Proof. Notice first that the expression in (I13p is equal to 
Q , WQ\ 




Hence, it suffices to show that for every fixed Q,C', and r G [Q], the averages 

, WQ\ , 

WM S + + 

converge. Since e{—rff/{QN)) —> 1 as —>■ oo, it suffices to show that the averages 

, 'iN/q\ 

TT7T\ E /(^?^ + 0e(-n|^) 


LiV/QJ 


( 14 ) 
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converge. Since / € Mconv, we have that the averages 

1 ^ 

Jy^f(Qn + r) 
n=l 

converge and using Lemma l3.ll for a(n) := f{Qn + r) , N replaced by [N/Q\, and N 
replaced by N, we deduce the needed convergence for the averages m- This completes 
the proof. □ 


Next we analyze the asymptotic behavior of the relevant ergodic averages with weights 
given by the structured components (defined by Theorem 12.61) of an element of Mconv 

Lemma 3.3. Let 9 > 0 , Q € N, and f G Mconv For N & let N > N be a prime 
that satisfies N = 1 mod Q and suppose that the limit fi := hmAr_).oo ^ exists. Let 
st e e defined by ([U]) and the convolution produet is defined in 

Then for every probability space {X, X, p), invertible commuting measure preserving 
transformations Ti,... ,T£: X ^ X, functions Fi,..., G L°°{p.), and polynomials 
Pij : Z —>■ Z, i = 1,... i, j = 1,..., m, the averages 


(15) 


1 


N 


L E/«(")■ (11 (Hr; 


r,Pi,m{n) 


N 


)F„ 


n=l 


i=l 


i=l 


converge in LP‘{pl). 

Proof. By the definition of and ^ we have that 

/ 7 V,stH= /iv( 0 <^ 7 v,e( 0 e(rai), n G [iV], 

where ^ is defined in ([8]). Since = 1 mod Q, it follows from (| 10 p and (ttH) that for 
N > 2QV if G g, then ^ can be uniquely represented as 

for some p £ {0,... ,Q — 1} and G ^ where for p = 0,1,...,Q — 1 we have 

E'p^e-={-P + jQ- - V<j<V}. 

Hence, it suffices to show that the averages m satisfy the asserted asymptotic when 
the (finite) sequence (/^g^(R))nG [TV] in (USD is replaced by the sequence 


' Q' “'''Q ' qn' 

for all p G {0, ..., <5 — 1} and f' G g. 

Recall that if /: Z^ C and ^ G Z^, then /(^ = ^ /(n)e( - n^). By 

Lemma [3^ the limit 


lim fpfi^N + 
N^oo Q’ 


that 


exists and it follows from _ 

'Q*' ’ Q> * QR’ 

As a consequence, both terms can be factored out from the averaging operation. 
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It remains to deal with the term e(n(^ + Let 

Vn := (H Tf• • • • • (11 n G N. 


By Theorem 12.21 we have that the averages 


1 

iV 


N 




n=l 


converge in L^(|u). Using Lemma f3.II we deduce that for all p,^' ^ h, Q ^ N, the averages 

« «' 




<3 <?iv 

converge in L?‘{p). This completes the proof. 

We are ready now to prove Theorem 11.11 


□ 


Proof of Theorem \l.l[ Let e > 0. Without loss of generality we can assume that all 
functions are bounded by 1. We let s > 2 be the integer and Cg be the implicit constant 
defined in Lemma [2.31 We apply Lemma [2.41 for s/{2Cs) in place of e and for k := 2. We 
get that there exist d > 0 and A^o £ such that for all integers N, N with Nq < sN < 
N < 2sN and /: ^ C with |/| < 1, the following implication holds: 

(16) if ||/||(7 =(z-) < then "" ' ^ 




It/U^oiv) - 2Cs 

We use the structural result of Theorem 12.6l for this 6 in place of e and for the previously 
defined s. We get that there exists 0 = 6(6, s) > 0 such that for all large enough G N, 
if N denotes the smallest prime such that N > sN and A^ = 1 mod Q (Q was introduced 
in Section El and depends only on 8), then we have the decomposition 


(17) 


/(^) = /7V,st(’") +/jV,un(^)’ 

{(Pf 


n G [A^], 


where = fp* ^N,e (^N,e defined by ([9])) and 


A^,un 




< 6 . 


It follows from (I16p that for all large enough N we have 


(18) 


[[Af] • fN,v 


< 


U‘{z,n) 2Cs 

Note also that the prime number theorem on arithmetic progressions implies that 

lim ^ = -. 

N^co ]\l S 

For m G N let {X,X,p,) be a probability space, Ti,... ,T£-. X —>■ X be invertible 
commuting measure preserving transformations, Fi,..., Fm G L°°{p,), pij: Z —>■ Z be 
polynomials where i = 1,... i, j = 1,... ,m. Let 

:= (H If • • • • • (IT n € N. 

i=l i=l 

If X G N, for a given oat : [X] —>■ C we define 

1 ^ 


ANian) ■= ^ aN{n) ■ 14 . 


n=l 
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Since is bounded by 2 and the functions Fi are bounded by 1, it follows from 

Lemma 12.31 and (IlSp that 


lim sup 

N^OO 




N,un' 




< e. 


Hence, 


lim sup 

N—yoo 


AnU) - ^Af(/ 


AT,St' 


Furthermore, by Lemma 


we have that the averages 


< e. 


converge in as —>■ oo. Combining the above we deduce that the sequence {Aj^{f)) 

is Cauchy in and hence it converges in Therefore, the sequence (/(n)) is a 

good universal weight for polynomial multiple mean convergence. 

Finally, we prove the last claim of Theorem 11.11 If the multiplicative function / is 
aperiodic, then by Theorem 12.71 we have that limjv^oo ||l[Af] ' /||[/s (2 ) ~ ^ every 

s > 2. Using Lemma 12.31 we deduce that the averages Hjv(/) converge to 0 in 
This verifies the asserted convergence. □ 


3.2. Proof of Theorem 11.21 


Proof of part (i) of Theorem If.ill Recall that the range of / is contained in a set of the 
form R = {1, C) • • • ) where is a root of unity of order k. Then 


fc —1 

( 19 ) ^f-HK){n) = '^^'^z-^{f{n)y. 

zdK j=0 

We establish the first claim of part (i). For f, m € N let {X,X,fj,) be a probability 
space, Ti,... ,Ti: X X he invertible commuting measure preserving transformations, 
Ti,..., Fm G pij : Z —> Z be polynomials where i = 1,... j = 1,... ,m. Using 

(I19p we see that in order to verify the asserted convergence it suffices to show that for 


14 := (H Tf• • • • • (11 

i=l i=l 


n G N, 


the averages 

1 ^ 

n=l 

converge in L^(/r) for j = 0,..., A: — 1. This follows from the first part of Theorem 11.11 
and the fact that under the stated assumptions on the range of / we have f^ G Mconv 
for j = 0,..., /c — 1 by part (i) of Proposition 12.101 

We establish now the second claim in part (i). Suppose that f^ is aperiodic for 
j = 1,..., A: — 1. By Proposition 12.51 it suffices to show that the set f~^{K) is Gowers 
uniform. Let s > 2 be an integer. We claim that 


lim 

N^OO 


i-z-iw 


1^ 

k 


= 0 . 


Using (I19p and the triangle inequality for the t/^-norms, we see that in order to verify 
the claim it suffices to show that for j = 1,..., A: — 1 we have lim^v-^-oo ||/'^ ||;7s(Zjv) ~ 

for every s G N. Since f^ is by assumption aperiodic for j = 1,...,A: — 1, this follows 
from Theorem 12.71 □ 
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Proof of part (ii) of Theorem \1.S[ We claim that if F is a Riemann integrable function 
on T with integral zero, then [{F o /)(n)) is a Gowers uniform sequence, meaning, 

(20) lim ||F o fWus/Zj^) = 0, for every s € N. 


Applying this for F := lx — mj{K), and using that mj{K) > 0, we deduce that the set 
f~^{K) is Gowers uniform; hence by Proposition 12.51 it is a set of polynomial multiple 
recurrence and mean convergence. 

We verify now ()20p . Without loss of generality we can assume that ||i^||oo — 1/2- Let 
s G N and e > 0. 

We first claim that the sequence (/(n)) is equidistributed on the unit circle. Indeed, 
using Weyl’s equidistribution criterion it suffices to show that for every non-zero J € Z 
we have 


1 


N 


lim 

N^oo jy ^^ 


= 0 . 


n=l 


This follows at once since by assumption f^ is aperiodic for j G N, hence it has average 
0. Taking complex conjugates we get a similar property for all negative j as well. 

Since F is Riemann integrable, bounded by 1/2, and has zero mean, there exists a 
trigonometric polynomial P on T, bounded by 1, with zero constant term, such that 




Since (/(n)) is equidistributed in T and the function F — P is Riemann integrable, we 
deduce that 


n=l 


Using this, the fact that \F — P\ is bounded by 2, and the estimate 

II 112" ^ II ||2"-1 II II 

ll“lluRZjv) - IffilliRZjv) 

which can be easily proved using the inductive definition of the norms U^{'Lm)i we deduce 
that 


(21) hmsup||Fo/-Po/||f^,(^ ) < e. 

We know by assumption that f^ is aperiodic for all j G N and taking complex conju¬ 
gates we get that f^ is aperiodic for all non-zero j G Z. Hence, Theorem 12.71 gives that 
lim^v-^-oo ||/'^ ||; 7 s(z^) ~ H all non-zero j G Z. Since the trigonometric polynomial P has 
zero constant term, it follows by the triangle inequality that limTV-^-oo IlL* o = 0. 

From this and (12 ip we deduce that 

hmsup||Fo/||j^,(2^) < e. 

N—¥00 

Since e > 0 is arbitrary, we get hmAr_).oo \\F o = 0 and the proof is complete. □ 


3.3. Proof of Corollaries 11.31 and 11.41 


Proof of Corollary Let 6 > 2 be an integer, be a root of unity of order b, and 
a G {0,... , 6 — 1}. 

In order to deal with the set we define the multiplicative function /i by /i(p^) = 

Q for all /c G N and primes p. Using part {ii) of Proposition 12.101 we deduce that // is 
aperiodic for j = 1,...,6—1. Applying Theorem 11.21 for this multiplicative function and 
for K := {(/“: a G A} we deduce the asserted claims for the set 
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In a similar fashion we prove the asserted claims for the set SQ^A,b] the only difference 
is that we apply Theorem 11.21 for the multiplicative function /2 defined by f 2 {p^) = 
for all fe € N and primes p. □ 

Proof of Corollary \1.4\ In order to deal with the set 5*0 ,a, a we define the multiplicative 
function /i by fi{p^) = e(a) for all /c G N and primes p. Using part (n) of Proposi¬ 
tion 12.101 we deduce that is aperiodic for all j G N. Applying Theorem 11.21 for this 
multiplicative function and for K := {e(t): t G (—A) U A} we deduce the asserted claims 
for the set S'q^a,o- 

In a similar fashion, we prove the asserted claims for the set S'o,a,q; the only difference 
is that we apply Theorem 1 1.21 for the multiplicative function /2 defined by f 2 {p^) = e{ka) 
for all A: G N and primes p. □ 

3.4. Extension to nilpotent groups. Essentially the same arguments used in the pre¬ 
vious subsections can be replicated in order to extend the main results of this article to 
the case where the transformations Ti,..., generate a nilpotent group. The only extra 
difficulty that we do not address here is to prove a variant of the uniformity estimates 
of Lemma [2.31 that deals with this more general setup. This requires a non-trivial mod¬ 
ification of the PET induction argument used in |12[ Lemma 3.5] along the lines of the 
argument used to prove [241 Theorem 4.2]. Assuming these estimates, substituting the 
convergence result of Theorem 12.21 with its nilpotent version (again due to M. Walsh), 
and the multiple recurrence result of Theorem 12.11 with a result of S. Leibman |22| . the 
rest of the arguments carries without any change. 

3.5. An alternate approach for weak convergence. If one is satisfied with analyzing 

weak convergence of the multiple ergodic averages in our main results (which suffices for 
proving multiple recurrence), then an alternate way to proceed is as follows: Using the 
main result from |9] we get that sequences of the form C{n) = f Fq ■ TfFi ■ ■ ■ Tf'Fi dp, 
n G N, can be decomposed in two terms, one that is an Ustep nilsequence (iV(n)) 
and another that contributes negligibly in evaluating weighted averages of the form 
^ ^^ 1 /(n) C'(n). This reduces matters to analyzing the limiting behavior of aver¬ 
ages of the form /(^)^ task that has been carried out in jll) . This 

way one can prove a version of Theorem 11.11 and related corollaries that deal with weak 
convergence, avoiding the full strength of the main structural result in HU. 

3.6. An alternate approach for recnrrence. We mention here an alternate way to 
prove “linear” multiple recurrence results for shifts of the sets 5'^,A,fe) 'S'n,A,6- After modi¬ 
fying the argument below along the lines of the proof of part (ii) of Theorem 11.21 we get 
similar results for the sets Sui,A,a, <S'n,A,Q- 

Definition. An IP^-set of integers is a set of the form 

{ap H- \- ap: 1 < i < k, ii < 12 <■■■< ie} 

where oi,..., are distinct positive integers. 

For example, an IPs-set has the form {m, n,r,m + n,m + r,n + r,m + n + r} with 
m,n, r G N distinct. 

Proposition 3.4. Let d, £ G N and Li,.. ., N be pairwise independent linear 

forms. Let 6, c G N, a G {0,... , 6 — 1}, and Sa,b,c be either Suj,a,b + c or 5n,a,fe + c. Then 
the set 

(22) {m G N'^: Li(ni),.. .,Li{ni) G Sa,b,c} 

has density b~^. Therefore, the set Safi,c contains IPk-sets of integers for every A; G N. 
Remark. Similar results hold for affine linear forms and all sets dehned in Theorem 11.21 
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Proof. Note first that for re > c we have 

(23) l5,^,^(re) = -^r“^(/(re-c))^' 

j=o 

where / is the multiplicative function and f is the 6-th root of unity defined in the proof 
of Corollary 11.31 Note also that the density of the set in (I22p is equal to 

( 24 ) N^ooW ^ 

me[A']‘' *=1 

the existence of the limit will be established momentarily. By m Theorem 1.1], if at 
least one of the functions /i, ■ ■ ■, /^ € Ad is aperiodic, then 

(25) llfj{L,{in)) = 0. 

Using that is aperiodic for j = 1,... ,6 — 1, in conjunction with (I23p and (]25H . we 
deduce that the limit in (12411 exits and is equal to b~^. □ 

Theorem 3.5 (Furstenberg, Katznelson |151 Theorem 10.1]). Let Ti,... ,Ti be commut¬ 
ing measure preserving transformations acting on the same probability space {X,X,fj.). 
Then for every A G T with p.{A) > 0 there exists k €zN such that set 

{re € N: /r(Tf’"A n ■ ■ ■ fl Tf^A) > 0} 
intersects non-trivially every IP^-set of integers. 

Combining Proposition 13.41 with Theorem 13.51 we get that the set of return times in 
Theorem 13.51 intersects non-trivially each of the sets Soj^a,b,c and Sn^a,b,c- Unfortunately, 
a polynomial extension of Theorem 13.51 is not yet available and we cannot get the full 
strength of the recurrence results of Corollaries 11.31 and 11.41 using such methods. 
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